Submitted to Robust’99, Tampere, Finland on March 5th 1999 ROBUST ASR WITH UNRELIABLE DATA AND MINIMAL ASSUMPTIONS

نویسنده

  • Martin Cooke
چکیده

Human speech perception withstands a wide variety of distortions, both experimentally applied and naturallyoccurring. A novel approach to these situations in robust ASR identifies the spectro-temporal regions which carry reliable speech evidence and treats the remainder as missing or uncertain. This standpoint makes minimal assumptions about any noise background. This paper describes two approaches to the adaptation of continuous-density hidden Markov model-based speech recognisers to deal with missing and uncertain acoustic data. The first computes output probabilities on the basis of the reliable evidence only, while the second estimates values for the unreliable regions by conditioning on the reliable parts. Both techniques are evaluated on the TIDigits corpus for several NOISEX noise sources, using spectral subtraction to identify reliable regions. These studies demonstrate that the two schemes behave comparably, and that both produce a significant performance advantage over spectral subtraction alone.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise

In this study, techniques for classification with missing or unreliable data are applied to the problem of noise-robustness in Automatic Speech Recognition (ASR). The techniques described make minimal assumptions about any noise background and rely instead on what is known about clean speech. A system is evaluated using the Aurora 2 connected digit recognition task. Using models trained on clea...

متن کامل

Robust automatic speech recognition with missing and unreliable acoustic data

Human speech perception is robust in the face of a wide variety of distortions, both experimentally applied and naturally-occurring. In these conditions, state-of-the-art automatic speech recognition technology fails. This paper describes an approach to robust ASR which acknowledges the fact that some spectro-temporal regions will be dominated by noise. For the purposes of recognition, these re...

متن کامل

Handling Missing and Unreliable Information in Speech Recognition

In this work, techniques for classiication with missing or unreliable data are applied to the problem of noise-robustness in Automatic Speech Recognition (ASR). The primary advantage of this viewpoint is that it makes minimal assumptions about any noise background. As motivation, we review evidence that the auditory system is capable of dealing with incomplete data and, indeed, does so in norma...

متن کامل

An improved opposition-based Crow Search Algorithm for Data Clustering

Data clustering is an ideal way of working with a huge amount of data and looking for a structure in the dataset. In other words, clustering is the classification of the same data; the similarity among the data in a cluster is maximum and the similarity among the data in the different clusters is minimal. The innovation of this paper is a clustering method based on the Crow Search Algorithm (CS...

متن کامل

A graphical model approach to ATLAS-free mining of MRI images

Improvements in medical imaging techniques have provided clinicians the ability to obtain detailed brain images of patients at lower costs. This increased availability of rich data opens up new avenues of research that promise better understanding of common brain ailments such as Alzheimer’s Disease and dementia. Improved data mining techniques, however, are required to leverage these new data ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999